Speaker Diarization in Personal Video Recordings Based on LDA and User Feedback

نویسنده

  • Zeenat Afroze
چکیده

In this paper, we present the speaker diarization system which is used in personal video recordings. Speaker diarization begins by the extraction of relevant features from the input signal. Features are measurable characteristics which are important to the distinction between different classes. They should have low inter-class similarity and also low intra-class variability. So, LDA is used to cope intra-speaker variability. We demonstrate improvement of the performance over the baseline system based on LDA. This paper also reports the performance of the speaker diarization system. A new approach, called the user feedback has been developed here to improve the performance of the speaker diarization system. 

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Weighted Oriented Optical Flow Histograms for Multimodal Speaker Diarization

Speaker diarization currently focuses on using audio features to partition an audio stream into speaker homogeneous speech regions, in other words to determine “who spoke when”. Recent speaker diarization corpora contains video recordings in addition to the commonly used audio. Thus, we investigated the benefits of incorporating video features, namely histograms of weighted oriented optical flo...

متن کامل

A Survey on Speaker Diarization Approach for Audio and Video Content Retrieval

Speaker diarization is the task of determining “who spoke when?” in an audio or video recording that contains an unknown amount of speech and also an unknown number of speakers. In the speaker diarization methods can be used to determine the speech part and non-speech part of the recordings. There are different approaches can be evaluated for speaker diarization. Accordingly, many important imp...

متن کامل

Believable Visual Feedback in Motor Learning Using Occlusion-based Clipping in Video Mapping

Gait rehabilitation systems provide patients with guidance and feedback that assist them to better perform the rehabilitation tasks. Real-time feedback can guide users to correct their movements. Research has shown that the quality of feedback is crucial to enhance motor learning in physical rehabilitation. Common feedback systems based on virtual reality present interactive feedback in a monit...

متن کامل

An iterative speaker re-diarization scheme for improving speaker-based entity extraction in multimedia archives

In this paper we present a novel scheme for improving speaker diarization by making use of repeating speakers across multiple recordings within a large corpus. We call this technique speaker re-diarization and demonstrate that it is possible to reuse the initial speaker-linked diarization outputs to boost diarization accuracy within individual recordings. We first propose and evaluate two novel...

متن کامل

Integration of TDOA features in information bottleneck framework for fast speaker diarization

In this paper we address the combination of multiple feature streams in a fast speaker diarization system for meeting recordings. Whenever Multiple Distant Microphones (MDM) are used, it is possible to estimate the Time Delay of Arrival (TDOA) for different channels. In [9], it is shown that TDOA can be used as additional features together with conventional spectral features for improving speak...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014